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Abstract 



The problem of estimating the proportion of satisfiable instances of a given CSP (constraint sat- 
isfaction problem) can be tackled through weighting. It consists in putting onto each solution a 
non-negative real value based on its neighborhood in a way that the total weight is at least 1 for 
each satisfiable instance. We define in this paper a general weighting scheme for the estimation of 
satisfiability of general CSPs. First we give some sufficient conditions for a weighting system to 
be correct. Then we show that this scheme allows for an improvement on the upper bound on the 
existence of non-trivial cores in 3-SAT obtained by Maneva and Sinclair [lj to 4.419. Another more 
common way of estimating satisfiability is ordering. This consists in putting a total order on the 
domain, which induces an orientation between neighboring solutions in a way that prevents circuits 
from appearing, and then counting only minimal elements. We compare ordering and weighting 
5^ ■ under various conditions. 
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1. Introduction 



Constraint satisfaction problems cover a large variety of problems that arise in many areas of 
combinatorial optimization. They are central in complexity theory because they are iVP-complete 
and also because one particular case - satisfiability of boolean formulas - was the first problem to be 



in 

identified in this class. In general, they consist in defining constraints on a set of variables taking 
their values in a given finite domain. Constraints specify which combinations of values assigned 
to subsets of variables are allowed (or dually are forbidden). A solution is a valuation (i.e. the 
assignment of a value to each variable) that does not violate any constraint. The satisfiability 
problem is the following: given an instance, decide the existence of a solution for it. 

Besides the design of algorithms for solving these problems, the research of structural properties 
for these problems has attracted much attention in the recent years. In particular, the empirical 
evidence of the existence of a threshold (rigorously established in some particular cases) in the 
satisfiability of some classes of CSPs has opened a field of research: attempts are made to rigorously 
establish the existence and the location of this threshold. This involves estimating the proportion of 
satisfiable instances in a given set of instances. The A^P-completeness of these problems in general 
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makes it difficult to determine whether a given instance is satisfiable; that may explain why direct 
counting of satisfiable instances is currently unfeasible. However, precisely because these problems 
are in NP, it is easy to determine whether some instance is satisfied by a given valuation and 
then to count the formulas satisfied by this valuation. Thus counting couples (formulas, solutions) 
is only accessible starting from a solution; moreover, given a solution, it is not complicated to 
investigate also its immediate neighborhood. But even at a distance of 2, i.e. with neighbors of 
neighbors, calculations become quite complicated (see Kirousis et al. [2|]). This fact imposes a strong 
restriction on the design of both estimation techniques studied: they can only make use of local 
information. We shall refer to this as the locality condition. 

Using one of the most popular techniques in the probabilistic method (cf. Alon and Spencer [3|), 
namely the first moment method, it is possible to bound from above the probability of satisfiability. 
The implementation of the first moment method makes use of Markov's inequality; one needs to 
define a non-negative random variable X that must be at least 1 for a satisfiable formula (we call 
that a correct random variable). Ideally, X should be as small as possible; in other words, it 
should be for unsatisfiable instances and as close to 1 as possible for satisfiable ones (if X is 1 for 
every satisfiable instance and for every unsatisfiable instance then we get the exact probability of 
satisfiability). The most straightforward candidate for X is simply the number of solutions. The 
method consists in counting for every valuation the number of instances that are satisfied by it and 
then summing up over all valuations. But since the number of solutions is generally too large, the 
method over-estimates the proportion of satisfiable instances. 

Many techniques have been developed to overcome this difficulty in various types of CSPs, 
Satisfiability of CNF formulas (Kamath et al. [4], Dubois and Boufkhad [5], Kirousis et al. [2], Dubois 
et al. [3)121) Boufkhad et al. [8j, Kaporis et al. [9(, Diaz et al. [10(, Boufkhad and Hugel [111), 3- 
Coloring of graphs (Achlioptas and Molloy [12J]), Binary CSPs (Achlioptas et al. [l3J, [l4|). . . Most 



of these methods share a common point: they count minimal elements under some partial order 
over solutions. We will refer to this method as solution selection through a partial ordering or for 
short ordering. Due to the locality condition, the partial order must be locally computable (i.e. 
must depend only on the immediate neighbors of the considered solution) . Two solutions of some 
instance are neighbors if they disagree only on the value taken by one variable. Both solutions may 
be ordered using a predetermined order on the values for this particular variable in this particular 
instance. Finally we count only those solutions having minimal values for all their variables with 
respect to their neighbors. 

Recently, Maneva et al. [l5j introduced a novel approach for the boolean satisfiability problem 
consisting in weighting partial valuations and solutions depending on their neighborhood. While 
not originally intended to estimate the proportion of satisfiable instances (but rather to analyze 
some properties of Belief Propagation algorithms) , it was though specifically used by Maneva and 
Sinclair [l| to estimate the probability of existence of non-trivial cores in random 3-SAT instances. 
The existence of non-trivial cores contains an important information on the structure of the space of 
solutions; moreover it is related to the clustering that has been proved to exist in k-SAT for k > 9 
Achlioptas and Ricci-Tersenghi [l6j , Maneva and Sinclair [l| show that in the 3-SAT instances, 
non-trivial cores do not exist for ratios of clauses to variables greater that 4.453. To do so they use 
valid partial valuations (i.e. satisfying some properties related to boolean satisfiability) and weight 
them according to their values and their neighborhood. 



2. Overview of Results 

Our first result consists in giving some sufficient conditions to make a weighting scheme correct 
for the estimation of satisfiability on general CSPs (theorem [5) Weight Conservation Theorem). 
Then we propose a general weighting scheme obeying these conditions (theorem If2j) . This scheme 
is based on: 

1. a weighting seed that expresses the relative importance of each value with respect to a variable 
and an instance; the seed is such that if all valuations were solutions, then their total weight 
would be exactly 1; 

2. a dispatching function expressing how the weights of forbidden valuations are dispatched 
among solutions to insure that counting weighted solutions will yield at least 1 for any satis- 
fiable instance. 

We will refer to this method as solution weighting or for short weighting. 

In theorem[T5J we show that the estimation of satisfiability used by Maneva and Sinclair [1] can 
be improved upon by using a weighting scheme based on a 3- valued CSP and obeying the conditions 
of our Weight Conservation Theorem (which shows that these conditions are somehow relevant). 
Thanks to this weighting system, we improve on the upper bound on the existence of non-trivial 
cores to 4.419. We completely reuse the proof of Maneva and Sinclair [l| for our new weighting 
system, showing that the improvement on the value of the bound is indeed due to a better weighting 
system. 

Till now the only way to compare ordering and weighting was to compute the estimations of 
satisfiability obtained by each of them on a certain set of instances and to choose the best one. We 
give some results comparing these two ways of estimating satisfiability in the following cases: 



• 



weighting and ordering can be instance dependent when such syntactic properties as the 
number of occurrences of variables and values etc. can guide the design of weighting functions 
and orderings. We show that in the general case where the weighting function is instance 
dependent and when the weighting is homogeneous (i.e. when weighting seeds and dispatch- 
ing functions are equal), weighting cannot be better than a well chosen instance dependent 
ordering (theorem [ 



• in the case where ordering and weighting are instance independent (which is the case of 
problems where the values are indistinguishable like graph coloring for example) and in the 
case of sets of instances closed under value renaming (which is the case of almost all sets of 
instances considered in the literature), we show that weighting and ordering are equivalent 
on average ( theorem 155)1 . 

3. Framework 

A CSP (Constraint Satisfaction Problem) is a triple F — (X, D, C) where X is a set of variables 
taking their values in the same finite domain D of values, and C is a set of constraints. A constraint 
is a couple (jc, R) where x G X k and R C D k for some integer k. R is interpreted as the tuples of 
allowed values. A valuation is a vector v G D x ; access to coordinate x£lof« will be denoted as 
v(x). It satisfies some constraint ((xi,X2,...,Xk),R) iff (v (xi) ,v (X2) , ■■■■,v (%k)) G R. A valuation 
is said to be a solution of a CSP instance iff it satisfies all of its constraints. 



We consider some sets T of CSP instances sharing the same set X of variables and the same 
domain D. In the rest of the paper n — \X\ denotes the number of variables, d = \D\ the size of 
the domain. Given a CSP instance F, let S (F) denotes the set of its solutions. 

We are interested in the neighborhood of valuations. Given a valuation v and a s D, we define 
v x <- a as the valuation obtained from v by changing the value of i to a (including the case when 
already a — v (x)). Given a variable x, two solutions are called x-adjacent if they agree on all 
variables but x: in other words a and r are x-adjacent iff r = o~ x< _ T t x y Note that for each variable 
x, x-adjacency is an equivalence relation on solutions. Bringing together the x-adjacency relations 
with respect to every variable and removing the loops (a, a) we get an non-oriented graph on S (F) 
that we call solutions network. Let Np (a, x) denote the equivalence class of a under x-adjacency 
(i.e. the neighborhood of a for variable x); note that Np (a, x) is a clique for x-adjacency. Such a 
clique will play a central role in our weighting system. We are also interested in the different values 
that x takes in this equivalence class, so we define Ap {a, x) = {r (x)} reJV i ax \- For example in 
figure [21 solutions ab and aa are y-adjacent, Np (ab, y) — {ab, aa} and Ap (ab, y) — {b, a}. 

Most of the results in this paper apply to any set of solutions, regardless of which CSP instance 
has generated them. The sole solutions network can be thought of as the input of the problem. 
However it should be borne in mind that weightings and orderings cannot be defined using the 
global knowledge of the whole set of solutions, because of the locality condition: one can only 
count instances having a given solution, and for each instance the solutions that are neighbors of 
this solution (rather than all solutions of a given instance). A convenient way to visualize this 
limitation is to imagine a network of processors (a processor representing a solution) where each 
processor has knowledge of its neighbors only and must compute from this knowledge its own weight 
or determine the orientation with respect to its neighbors. 

4. Weighting of Solutions 

First we define a weighting system for all valuations (solutions or not) which sums up to 1. Then 
we give sufficient conditions on a weighting system on solutions only, such that a transfer between 
this weighting system and the previous one may be possible. Doing this we establish a general 
framework for putting weights onto solutions, and use it to derive two particular weighting systems: 
the first one addresses general CSPs and the second onejs built to improve on the weighting system 
introduced by Maneva et al. [15| . Maneva and Sinclair [I], Ardila and Maneva [17|. The purpose of 
such a transfer is to estimate the global weight in the weighting system on solutions by means of 
the global weight of the weighting system on all valuations (which is easier to compute) . 

4-1- Weighting Seeds 

Definition 1. For a CSP F = (X,D,C) a weighting seed is a function sp : X x D — >• R + . We 
say that sp is unitary iff Vx G X, ~^2 aeD sp (x, a) = 1. 

Now we define the unladen weight of any valuation v (solution or not) with respect to some 
weighting seed ap as : 

U F {v)= Y[sp(x,v(x)) . (1) 

xGX 

As for the actual weight of a solution, we want to take into account the neighborhood of the 
solution, so we put the weight wf {a, x) on each variable x of solution a. We will see later how to 
build wp from sp. 
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The actual weight of a solution is: 

Wf {a) = H w F (a, x) . (2) 

xeX 

By extension, the weight of a set S of solutions is: 

W F (S) = J2W F (a) . (3) 

<J£S 

Lemma 2. If the weighting seed Sp is unitary, then the total unladen weight of all valuations is 1: 
J2veD^ u F(v) = !■ 



Proof. 



J2 u F (v) = y, U **■&*{*)) 

veD x veD x xex 



n j2 s F( x > a ) 

xex ae 



xex aeD 



xex 
= 1 . 

□ 

This weight U F is indeed simple to handle. The purpose is now to connect it with W F . Just as 
we defined weights W F of solutions in a product form variable per variable, so shall we build our 
transfer system. 



4-2. Decomposers 

Definition 3. We say that w F is decomposable by a family (#F>,x,a) iff f° r au solution a of F and 
all variable x, w F (o~,x) — ^2 a&F >^F,a;x,a- Such a family will be referred to as a decomposer. We 
define onto it the following transfer quantities between a solution a and a valuation v. 



{_[ 8f,it,x,v(x) ■ (4) 



xex 



Lemma 4. (Transfer lemma). Let F be a CSP instance and a any of its solutions. If w F is 
decomposable by family (S Fi(7 . x ,a), then 

W F (a) = J2 Tf ^ ■ ( 5 ) 

veD x 



4.3 Weight Conservation Theorem 6 

Proof. It is sufficient to expand the weight of a solution as follows: 

W F 0) = \\w F (a, x) 

xGX 

= \ J 2^ ^F,a,x,a 
x£X a£D 

= 2^ | ]_ &Fm,x,v(x) 

veD x x£X 

= 2_^ Tf,ct^v ■ 
v£D x 

□ 

We want to insure that transfers made towards a valuation are at least its unladen weight, hence 
we define the following property of covering. 

Definition 5. Let S be a subset of S (F); we say that (Tf, S) covers Up iff Vu € D x , J2 a es Tf,o-->v > 
U F (v). 

We can now state some general conditions that are sufficient for a weighting scheme to be correct. 

4-3. Weight Conservation Theorem 

Theorem 6. (Weight Conservation Theorem) . If the following assumptions hold: 

1. the weighting seed Sf is unitary, 

2. the actual weight wf is decomposable by family (Sp,a,x,a), 

3. (Tf,S) covers Uf, 

then W F (S) > 1. 

Proof. Since wf is decomposable by family (Sp,a,x.a), lemma 3] asserts that \/a G S, Wf (a) = 

T, v eD xT F^^v- Thus 

W F (S) = Y^ W f ( a ) 

= 2_. /_. Tp.a->-v by lemma 2] 

treSv£D x 

= Yl z2 t f,^v 

v£D x o-eS 

> 2_] Uf{v) since (Tp,S) covers Uf ■ 

vGD x 

Moreover by lemma[2l since sf is unitary, ^2 veD x Uf (v) = 1- d 

Thus we have exhibited three sufficient conditions to get a weight conservation theorem. These 
conditions might not be necessary; however not any weighting system wf will be correct, as shown 



in example on figure 1(a) So let us introduce a way to build wf from sf in a way that is intended 



to match the conditions of our Weight Conservation Theorem. 



4.4 Generators 



4-4- Generators 

All weights we put onto solutions (either in section 14.51 or in section [3]) are built from a weight 
generator, as follows. 

Definition 7. A generator is a function uip : IxflxP (D) — > R + . We say that wp is unitary 
iff for all variable x and all nonempty subset A of D, J2 a eA ^f ( x ' a ' ^) = ■'■• 

From the weight generator u/j? we now define the actual weight uif of a variable in a solution: 



Wf (<T, x) 



ojf (a;, cr (#) , ^4f (c, #)) 



(6) 



Remark. If <r and r are 2 solutions such that o~ {x) = t (x) and Ap (a, x) — Ap(r,x), then 
Wf (c, x) = wf (t, a;). This is what Boufkhad and Hugel |ll| call a uniform weighting. 

This may suggest that it could be sufficient to put any weights such that the sum of weights on 



any clique would be 1; but it is not the case (cf. example on figure 1(a) ) 



4-5. Dispatchers 

Definition 8. A dispatcher is a function oIf '■ X x D — > R^ . 

Using the weighting seed sf and the dispatcher oIf we now build the weight generator cup of 
variables in a solution. Each variable will keep its seed sp; moreover the weights of forbidden values 
will be dispatched to allowed values thanks to dp, in the following way: 



wf( ^ a) = Uf (x, a) + E ^;t,6) E beD \A sf (x, b) ifaGA; 
I otherwise 



(7) 



A represents a category of set of allowed values; so the dispatcher dp dispatches the total 
weighting seed of forbidden values among allowed values. 

Fact. If sp is unitary, so is cup. 

Definition 9. We say that the weighting system is homogeneous when dp = sp. In this noticeable 
case the same function is used to assign a weighting seed and to dispatch remaining weights among 
neighbors. 



Examples of Weightings. As one can see in figure 1(a) even if we put a total weight of 1 on each 
clique, the overall weight can be less than 1 . To prevent such bad configurations we let our weights 



take the form of seeds+dispatchers (figures 1 (b) and 1 (c) I . The purpose of building weights from 



seeds and dispatchers is to prevent the same kind of inversions that we encountered for orientations 



(which led to circuits): in figure 1(a) in clique {a, b} for variable x, a is given a much smaller weight 



than b, whereas in clique {a, b, c} for the same variable x, the opposite occurs. In fact dispatchers 
allow some reshuffling of weights between different cliques of the same variable (see the weights 



given to values a and b in cliques {a, b} and {a,b,c} for variable x on figure 1(c) I, but the fact 



that seeds and dispatchers are assigned to each individual couple (variable, value) enables a kind of 
consistency between a clique and its sub-cliques, preventing circuit-like structures from appearing. 



s F 



• figure 1(b) was obtained by the following choice of sf and dp (homogeneous case, so dF = sf): 
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(a) A bad weighting system. 
The total weight is 2*(0.1*0.9+ 
0.1*0.7 + 0.2) = 0.72. 



(b) An homogeneous weighting 
system. The total weight is 
0.7 + 0.3 + 0.2*0.4 + 0.1*0.571 + 
0.3*0.667+0.333*0.429 ~ 1.48. 



(c) An heterogeneous weighting 
system. The total weight is 
0.7 + 0.3 + 0.2*0.4 + 0.1*0.486 + 
0.3*0.433 + 0.567*0.514 ~ 1.55. 



Figure 1: Some basic examples of weights. Notations are the same as in figure [2] 
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• figure 1(c) was obtained by the following choice of sp and dp: 
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We come back to our weighting system wf built from sf and cLf and show that it may be used 
to estimate satisfiability if sf is unitary. So our first result concerning this weighting system states 
that this system is correct for the estimation of satisfiability (theorem [j"2l below) . To prove it, we 
use our Weight Conservation Theorem, using the following decomposers: 



Sf (x, a) if a (x) = a; 

^ {xM t( x ^ F(x,a) i{a£A F (a,x); 

otherwise . 







(8) 



We must now prove that the conditions of our Weight Conservation Theorem are satisfied: wp 
is decomposable family {8F,a,x.a) and (Tp,g) covers Uf- 

Lemma 10. wf is decomposable by family (<5f,<7,x,o)- 

Proof. By definitions: 



aeD 



dp (x,a (x)) 



^ / d F (x,< 
2_^ I s F (x, a) l a=CT ( x ) + =^ 

aeD \ l~,aGA F (a : x 



dp (x,a) 



sf (x,a) l a <£A F (o,x) 



= s F (x,a (x)) + 
= wf {a,x) . 



d F (x,a(x)) 



, . E s F(x,a) 

^a £ A F (a, X ) a F(X,a) ^ Af ^ x) 



a 



4.5 Dispatchers 



As the unladen weight of a valuation is scattered among lots of solutions, in the proof of the 
following lemma we use an algorithm building a tree in order to catch enough solutions to insure 
the covering condition. The proof is somewhat technical and may be skipped at first reading. 

Lemma 11. Let g be any connected component of the solutions networks (F). Then (Tp,g) covers 
U F . 

Proof. First we need some definitions. A partial valuation r\ over Y C X is a function from Y to the 
set D. The domain of 77 is Dom (77) = Y. The level (of undetermination) of r\ is Level (77) = |X\Y|. 
Let Z C Y C X, let 1 be a partial valuation over Z and 77 be a partial valuation over Y . Since 
Z Q Y, we say that 1 <Dom V- Of course <Dom is a partial order relation. We say that 77 is an 
extension of 1 iff Vz G Z 1 rj(z) = o(z), in which case we also say that 1 is the restriction of 77 to Z: 
1 = n z . In the particular case when Y = Z U {x} with x (jz Z , we denote by t Ih>(11 the extension 
of l to Y assigning value a to x. Let g be a connected component of the solutions network. Note 
that the empty valuation e (with domain 0) is extensible to a solution in g as soon as g ^ 0. Given 
a partial valuation 77, we call E g (rf) the set of its extensions which are elements of g and r g (77) the 
set of restrictions of 77 extensible to a solution in g (i.e. restrictions r of 77 such that E g (r) ^ 0). 

Let us take any valuation v. We must prove that J2<reo Tf.o-^v > Up {v). Since g 7^ 0, e e r 9 (y) 
so r 9 (w) ^ and we can pick an element vq in r g (v) maximal with respect to the order <Dom- 
We arbitrarily put indices 1 . . .no onto the remaining 77,0 = Level (vq) variables: x\, . . . ,i„ (i.e. 
variables not set by vq). In the following algorithm we shall bind a fictitious weight f (rf) and a 
solution t (77) to a partial valuation 77. At the beginning / (vo) — Uf (v), and we make a call of 
Extend (vq). 

Algorithm 1 Extensions of a partial valuation. 



procedure Extend^) 
i <r- Level (77) 
if i = then 

S ^Sl){r]} 
else 

r (77) <— a solution maximizing ^2 beA < a x .\ dp (xi, b) among a S E g (rf) 
for all a G Ap (r (77) , xf) do 

/ (Vx^a) i~ Y— " ".d F (xi.b)f (*?) 

9: Extend (rj x ^ a ) 

Informally we are building a tree and propagating weights from the root v$ (at level no) to leafs 
which are solutions (at level 0) in a conservative way: the total fictitious weight on level i will be 
the same as that of level i + 1 . 

Formally, what can we insure along this process? 

1. The first thing to notice is that the algorithm stops; namely the nested calls of Extend (77) 
decrement Level (77) till it reaches 0. 

2. Secondly S is indeed a set of solutions in g extending vq. Namely at each call of Extend (77), 
77 is extensible to a solution in g and the set of unset variables of 77 is {xi, . . . ,Xi\, where 
i = Level (77). Thus when i — 0, 77 is a solution in g. We prove this by induction: 

(a) at the beginning: vq £ r g (v), vq trivially extends itself, E g (vo) 7^ and the set of unset 
variables of vq is {x\, . . . , x no }; 
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(b) now suppose that E g (77) ^ 0, 77 extends vq and the unset variables of 77 are {xi, . . . , x^}; 
given r (77) G Eg (77), let a £ A F (r (77) , x$); then the valuation r {f]) x . i _ a is a solution by 
definition of A F (r (77) , a;,); moreover it is connected to r (77), thus r (?7)_. , is an element 
of component <?. Moreover since r (77) is an extension of 77 and x, is unset in 77, r (77) 2 ,. i _ Q 
is an extension of r\ Xi ^ a - Thus r (77)^^ € -E ff (r7 XjM . ), so £ ff (r7 XiM . ) ^ 0. Of course, 
r\ Xi ^ a extends vq, the unset variables of rj Xi ^ a are {xi, . . . , Xi_i} and Level {n Xi ^ a ) = 
Level (77) — 1 = 7 — 1. 

3. X)o-e5 f ( CT ) = ^ ( w )' namer y among partial valuations considered in the process, 77 £ S iff 
Level (7/) = 0. Moreover we now prove by induction that X)lcvc10))=j / (v) = Uf (v): 

(a) at the beginning when i — no, the only partial valuation of level no is vq and / (vq) — 
U F (v); 

(b) now suppose that X^LcvcKjiW / iv) = Uf (v); in our process each partial valuation 77 of 
level i — 1 has one and only one parent in level i, which is given by the restriction 7/ of 
77 to Dom (v ) U {x l+ i , . . . , x„ }; thus 

£/(*) = £ £ /bUO 

Lcvcl(?^)— i— 1 Level(77')— i a£A^ (r (77'), ais) 

sr v d F ( Xl ,a) 

= E /M 

Lcvcl(r/) = i 

= c/fW ■ 

4. V(T € iS, Vz G {1, . . . ,no} , i> (xi) ^ Ajr (cr, Xj). Suppose on the contrary that 3a £ S,3i £ 
{1, . . . , n } , v (xj) G Ajr (cr, x,); the partial valuation fo^M-iife) is still a restriction of v; 
moreover, since by item[3J a is an extension of vq, u Xi4 _ v i Xi \ is an extension of v 0xih ^ v r Xi y, and 
since v (xj) G A F (<7,Xj), u Xl ^ v ( Xl ) is a solution. Thus Vo^h-H,^) G r fl (v) andv 0x< ^. v ( x< ) >Dom 
tjo, contradicting the maximality of vq in r g (v). 

5. Vcr £ S, f (a) < TF,a-^v] namely, let us take any a £ S: 

T F ,o^v = JJ Sf,<t,x,v(x) by definition [3] 

sex 

n/ dj7 (x, cr (x)) \ __ 

s F (ar, u (x)) 1„( X ) =CT ( X ) + =^ -— rl^x)^^,*) b Y eq. 121 
ie i V 2^aeA F (^x) d F{x,a) J 

ni r s,j\ d F (x l ,(j(x l ))s F (xi,v(xi)) ™ ,n, 

sf{x,v(x))\\ — =^ — r — by item [2] and m 

= U F (v)f[ = *Z<*ll&» by definition ffl. 
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Moreover note that 



no flu r "i | 

, , s f , MT V \Oom(yo)U{xi,...,x no ^J 

/ \?) = J (.^|Dom(oo)J 11 ' 



1=1 / ( Cr |Dom( l) o)u{2: l+ i,...,a;„ }J 



/(. 



|Dom(-Uo)U-[a;£-fi,...,2i no ka^i— »(x(a;.;) 



»=1 / ^ (T |Doni('!)o)u{x i+ i,...,x„ } 



- E «^(H' l - 1 *(, + ,..,„ ))^-) df(I " ffl) ' 



Since of course, for all i between 1 and nn, a £ E„ I o~ lr , , >, ,r i ), by choice of 

t (77) in line [6] of algorithm [1] we have that ^ A / / \ \ d^ (xj,a) > 

EaeAHo-.x,) d f (^> a )' whence /(a) < U F («) ELSi g^^flfa) = T *>->«- 
Thus we finally get that 

X! T *>^ ^ H / ( ct ) h y item El 

> TJ / (cr) because S C g . 

Moreover, by item[31 J^aeS f ( <J ) = ^ F ( w )' * nus (Tf, 9) covers Uf- d 

From lemmas [TOl and ITT ] we conclude that our weighting system built from seeds and dispatchers 
obeys the conditions of the Weight Conservation Theorem. 

Theorem 12. Let F be a satisfiable CSP instance and g the solutions in a connected component of 
the solutions network of F. Weights wf are built from seeds sf and dispatchers dp, as in definition 
[3 If the weighting seed sf is unitary, then Wf (5) > 1. 

Remark. In this paper we do not address the question of choosing the best sf and dp for a given 
instance F or for a given family of instances, which must be custom-tailored depending on the 
considered problem. 

5. Boolean Case: a Better Upper Bound on the Existence of Non- Trivial Cores 

5.1. A Better Weighting for Partial Valuations 

In order to estimate boolean satisfiability of formulas, Maneva and Sinclair [1| use a so called 
Weight Preservation Theorem. Valuations here are mappings from X to D = {0, 1, *}. The value * 
is to be interpreted as or 1. They call a valuation valid iff each clause contains at least one true 
literal or two starred literals. In this section, one has to be aware about the fact that we define a 
boolean solution as a valid valuation taking its values in {0, 1} only! <S (F) still denotes the set of 
valid valuations of an instance F (with values in {0, 1, *}) and Ap (c, x) still refers to neighborhood 
in S (F) . Note that any formula has at least one valid valuation: the one that gives the value * to 
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every variable (the so-called trivial core), so the existence of valid valuations does not guarantee 
the existence of boolean solutions. Nevertheless, counting weighted valid valuations can be used to 
estimate boolean satisfiability. 

Maneva and Sinclair choose their weights as follows: each variable has a weighting seed So (x), 
s* (x) such that so (x) + s* (x) = 1, and for all valid valuation a and all variable x they put the 
following weight: 

if a (x) = * ; 
qF (cr, x) -= ^ so (x) if a (x) ^ * and * £ Af (cr, x) ; (9) 

otherwise . 

As shown by Maneva et al. [15], the sum of the weights of all valid valuations reachable from 
any boolean solution is exactly 1. The reachability property is defined as the existence of a path 
from the boolean solution to the valid valuation where at each step a variable is given the value * 
while maintaining the validity property. Since a given valid valuation may be reachable from lots of 
different boolean solutions (but sometimes from no one), counting the weighted partial valuations 
hopefully enables to count less than the number of boolean solutions. 

Using these weights, Maneva and Sinclair [1] count the so called non-trivial cores a; a non-trivial 
core cr is a valid valuation with a linear number of non-starred and non-starrable variables (i.e. such 
that Ap (cr, x) = {cr (x)}). Many non-trivial cores are not extensible to solutions; a core is extensible 
to a solution when there is a boolean valuation of the starred literals which is a boolean solution. 
They manage to count only cores which are extensible to a boolean solution, and they estimate the 
satisfiability of the starred part of the formula by weighting valid assignments as defined in equation 
O In this section we define our new weights and show that they are correct, and in section I5T21 we 
use them to improve on Maneva & Sinclair's upper bound, from 4.453 to 4.419. 

Before we give this improvement and show its correctness, we want to stress an important 
difference between the weighting of solutions of general CSPs as defined in the previous sections 
and the weighting defined in this section: in the previous sections, an unsatisfiable formula has 
always a total weight of while in the present one, an unsatisfiable formula (a formula with no 
boolean solution) will have a non-zero total weight (provided the weights of the value * are not 0). 
This is the price one has to pay to lower the weights of satisfiable formulas. This fact makes difficult 
to establish a general comparison between both methods, because they are highly dependent on the 
set of instances that are considered and in particular on the proportion of unsatisfiable instances 
among them. 

To improve on Maneva et al.'s estimation system, we choose the following weights: each variable 
x has a unitary weighting seed sf (x, 0), sf (x, 1) and sp (x, *)• From this seed sf we define the 
weight generator up as follows: 



Sf (x, a) if a = * and a £ A ; 

luf (x,a,A) = i sf (x,a) + ^2 beD -, A SF (x,b) if a ^ * and a £ A ; (10) 

if a <£ A . 

As before in section T4. 51 we define the actual weight wp (cr, x) = ujp {x, o ix) , Ap (a, x)). 



5.1 A Better Weighting for Partial Valuations 13 



Remark 13. Noticeable values of u>f'- 

lof (x,0, {0}) =uf(x, 1,{1}) = sf (x, 0) + sf (x, 1) + sf (x, *) = 1 ; 

wf(z,0, {0,*}) =u>f(x, 1,{1,*}) = s F (x, 0) + s F (x, 1) ; 

Wi7(i,a,{0,l,*}) = sf(o;,o) ; 

Wf (i,*,A) = sf (x, *) if * G A . 

Remark 14. cjf is almost unitary, since for all nonempty A C D, A / {*}, A ^ {0, 1}, 

SaGA w l 2 "' a > ^) = 1' (0) 1} cannot be a clique in this model of validity, because if both and 1 

are allowed, so is *. However {*} can be a clique, and in this case u> (x, *, {*}) = sp (x, *). 

Our system can be seen as a split of 1 — s* (a;) into sp (x, 0) and sf (x, 1) (instead of just Sq (x) 
for Maneva) in the case when a (x) ^ * and Af (tr, x) = {0, 1, *}; thus our weights are smaller than 
Maneva's, though we are able to insure that they are correct. 

This system is different from the system seeds+dispatchers, because here a fixed variable at 
value * is given a weight of sp (x, *), whereas dispatchers would give it a weight of 1. However we 
are able to use our Weight Conservation Theorem, using the following decomposers: 

, , ., \<r(x) =a 
sp [x, a) it < ; 

I or (cr (x) 7^ * and a $_ Ap (a, x)) (11) 

otherwise . 

We must now prove that the conditions of our Weight Conservation Theorem are satisfied: wp 
is decomposable family (SF y a-,x,a) and (Tp,g) covers Uf- 

Lemma 15. wp is decomposable by family {8f,<j,x,o)- 

Proof. By definitions: 

1. if a (x) = *: YsaeD $F,<r,x,a = YsaeD S F ( x , a ) 1 a=a(x) = S F {x, a (x)) = Wp (a, x)\ 

2. if a (x) ^ *: 

2_^^F,a.x.a = 2 , SF ( X > °) ( 1 n=g'(a) + lg<£A F (cr,x)) 
aeD aeD 

= s F (x, a (x)) + YJ SF{x,b) 

b£A F (a,x) 
= Wp (a, x) . 

a 

Lemma 16. Let v be a valuation and g be any connected component of the network of valid val- 
uations S (F) containing a boolean solution. Then there exists a valid valuation a G g such that 
U F (v) = Tf i0 -_>.„. 

Proof. Let us take any v G D x and a boolean solution o-q G g. At the beginning we put a = o~q. 
Consider the following procedure: 

• If there is a variable x G X such that a (x) ^ v (x) and o~ x -t^v(x) remains a valid valuation, 
then change a to o~ x <- v ( x ). 
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We iterate this procedure till there is no variable x € X such that a (x) ^ v (x) and ct x ^ v ( x ) remains 
a valid valuation. This eventually happens because at each step we make a move towards v, and X is 
finite. So in the end, each variable in a has either its initial boolean value in oo or the value given by 
v. In other words, for any x € X, either a (x) = v (x) or (a (x) ^ * and v (x) £ Ap (a, x)). Thus by 
equation llH 5p axv r x \ = sf (x,v (x)), which in turn by definitions Q] and [3] yields Tp,a^v = Up (v). 
Moreover, by construction, the ending a is also in g. □ 

Corollary 17. Let g be any connected component of the network of valid valuations S (F) contain- 
ing a boolean solution. Then (Tp,g) covers Up. 

Thus our weighting system obeys the Weight Conservation Theorem, and we can conclude that 
7 (F) = W F {S (F)) > W F (g) > 1 and state the following theorem: 

Theorem 18. wp as defined in equation{TQ yields 7 (F) > 1 whenever F admits a boolean solution. 

Remark. We cannot apply theorem [35] because there is no closure by renaming; namely {0, 1} 
cannot be a clique (if both and 1 are allowed, so is *), whereas {0, *} and {1, *} can. 

Note that in the particular case where for all x G X , sf (x, 0) = 0, sf (x, 1) = 1 and sp (x, *) = 0, 
we count what Dubois and Boufkhad [5| call Negatively Prime Solutions (NPSs). Moreover, as 
soon as sf (x, *) = 0, this weighting can be seen as seeds+dispatchers on a boolean domain (so this 
weighting is homogeneous). 

We used the weighting defined in equation [TU] to compute an upper bound of the threshold 
of random 3-SAT: taking seeds independent of F and x, we obtained the best estimation when 
Sf (x, *) = (and the corresponding upper bound is 4.643, just like with NPSs). We conjecture 
that even if one takes seeds dependent on F or x, the best choice of sf (x,*) to estimate boolean 
unsatisfiability remains indeed 0. The reason why we think so, is that, as described in remark [T4l 
u>f is almost unitary, except for the clique {*}, in which case u>f (x, *, {*}) = sp (x, *). 

5.2. Application: Non-Existence of Non-Trivial Cores in 3-SAT 

We apply here the weighting defined in 15.11 to improve on the upper bound on the existence 
of non-trivial cores in 3-SAT shown by Maneva and Sinclair [l|. First, we recall the basic notions 
defined by Maneva and Sinclair [1| and we reformulate them according to our notations. Starting 
from solutions (which are also valid partial assignments), the following process is iterated: when- 
ever there is a starrable variable x (i.e. a variable such that under the current valuation a, we 
have o~ (x) ^ * and A(a,x) contains *), consider for the next step the valuation obtained from 
a by assigning * to i. This process stops when no such variables exist, and the resulting partial 
assignment is called a core. Only the cores a for which the set of non-starred variables is not empty 
are interesting; they are said to be non-trivial. Such cores contain an important information on the 
geometry of the space of solutions and underlie the so-called clustering that explains the difficulty 
of solving such instances. 

To study the existence of cores, Maneva and Sinclair [l| make use of covers. A cover is a partial 
valuation where every non-starred variable is non-starrable. Obviously, each core is also a cover but 
the converse is not true. Indeed, by construction, for a core there exist always a way of assigning the 
starred variables so that the formula is satisfied, while for covers no such condition is guaranteed. 
The size of a core or a cover is the number of non-starred variables. 

Maneva and Sinclair [1] compute their upper bound in two steps: 
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1. First, they use the first moment method to upper-bound the probability of existence of covers 
of certain sizes. This allows them to discard the ranges of sizes where covers do not exist, and 
within these ranges cores cannot exist either; 

2. As for the ranges that are not discarded by the previous step, the probability that covers 
can be extended to solutions is upper-bounded using the first moment method through the 
weighting system. 

Implementing this method as described in Maneva and Sinclair [l| we obtain the following improve- 
ment. 

Theorem 19. Random instances of 3-SAT with density greater than 4.419 have no non-trivial 
cores with high probability. 

As we mimic Maneva and Sinclair [l]'s proof, we defer it to the appendix. Let us just mention 
the weights we use : sp (x, *) = p, sf (x, 1) = sf (x, 0) = (1 — p) /2. We determined the best values 
for p by numerical simulations, and it depends on the parameter a (size of cores). By symmetry, 
any combination of values given to the sf (x, 1) and sf (ar, 0) summing up to 1 — p will give the 
same result. 

One can wonder whether this bound can be improved upon using boolean solutions and ordering. 
The answer is no since that case is a particular case of our weighting system (eq. 1 1 0[) : the case where 
sf {x, *) = 0, sf [x, 1) = 1 and sf [x, 0) = or vice-versa. It is easy to answer this question in the 
boolean case because any ordering can be seen as a particular weighting (by choosing the weight 1 
for a value and for the other one). But in the general case of larger domain CSPs the answer is 
not so easy. In the following section, we give general comparisons between orderings and weightings 
in different general cases. 

6. Weighting versus Ordering 

6.1. Partial Ordering of Solutions 

Given a CSP instance F, various partial orders -<p can be defined on the set of solutions such 
that for every two adjacent solutions a and r of F, we have either a <f t or t -<p o~. The aim of 
the partial order here is to provide a measure on the solutions network through the number of its 
minimal elements. Let M.^ F (F) be the set of minimal solutions of F with respect to the order -<jr. 

In the solutions network of F, a partial order -<p can be seen as a circuit-free orientation of 
the edges of the graph such that an edge goes from r to a iff a <p t; then minimal elements are 
vertices with no outgoing edges. In general one seeks partial orderings that have the least number 
of minimal elements; however the choice is limited because orderings must be chosen according to 
local criteria only. 

Construction of an Ordering. 

Definition 20. Given a variable x G X, a total strict order <f,x on D gives an orientation between 
neighboring solutions: a -4f x t iff ct and r are re-adjacent and a (x) <f,x t (x). Note that -4p x is 
a partial strict order on the set of solutions, but a total strict order in each clique Np (c, x). 

We can bring all partial orders -<f. x together on the set of solutions, as follows: if a and r are 
x-adjacent and different, then a -<f t iff a <f,x t. This is possible because two different solutions 
a and t cannot be both x-adjacent and y-adjacent for two different variables x and y. We say that 
-<f is the orientation on S (F) induced by the set {(x, <f,x)} eX ■ 
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Lemma 21. If -<p is the orientation on S (F) induced by a set {(x, <F,x)} xeX > then -Kp is circuit- 
free. 

Proof. Suppose on the contrary that there exists a circuit o~i ~<f ■ • • ^f o~i -<p ci for some I > 2. 
Let us consider the variable x such that o\ ~<f,x o~%. For any i < I, either o~i (x) = o~i+\ (x) (if 
o~i and <Ji+\ are not z-adjacent) or o~i (x) <f,x cn+i (x) (if o~i and c^+i are a;-adjacent). Thus 
o~\ (x) <f,x &2 (x) and o~2 (x) <f,x oz (x) <f.x ■ ■ • <f,x o~i (x) <f,x 01 (x): a contradiction. D 

Corollary 22. The transitive closure of ^p is a strict order relation. 

Instance Dependent or not. 

• Instance dependent ordering. In this case, we put for each variable x G X and each CSP 
instance F a total order <p^ x onto the domain D of possible values. As mentioned above, 
we (partially) order solutions as follows: let a G S (F) and r £ Np (a, x); we have a -<p r 
if and only if a (x) <f. x t (x). The motivation for the instance dependent ordering is that 
some syntactic properties of the CSP instance F can be exploited to define a suitable order 
for that instance. 

• Instance independent ordering. This is a particular case of the above ordering, when the total 
order < x on D does not depend on F. For some problems, no preferred order can be defined 
given some instance. This happens in particular when values are indistinguishable because of 
the symmetry of the problem (e.g. colors in graph coloring). 



Examples of Orientations. We first give an example (figure 2(a) I of an orientation which is not 



circuit-free, even though it was built from the following local orderings on each individual clique: 

• in cliques {a, b} for variables x and y, we have b < a; 

• in cliques {a, 6, c} for variables x and y, we have a < c < b. 

The problem comes from the fact that a and b are ordered differently in clique {a, b, c} and its 
sub-clique {a, 6}, which led us to consider only orientations built in the following way: we choose 
for each variable x a total order < x on the domain D and use it for each sub-clique of D. This is 



what Boufkhad and Hugel [11] call a uniform orientation. Example in figure 2(b) was obtained by 
the following orders: c < x b < x a and c < y a < y b. This orientation is circuit-free and has two 
minimal elements. Now among good orientations, the less minimal elements they have, the better 



they are; figure 2(c) which was obtained by the following orders: c < x b < x a and a < y c < y b 



gives an example of an orientation with just one minimal element. 

6.2. Homogeneous Case: Weighting Is Not Better than Ordering 
As we have seen, the weighting is based upon two functions: 

1. the weighting seed sf that determines the intrinsic weight of each value and then allows to 
compute the intrinsic unladen weight of each valuation; 

2. the dispatcher dp that represents how the weights of forbidden valuations are scattered among 
the authorized ones. 



6.2 Homogeneous Case: Weighting Is Not Better than Ordering 
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(a) A bad orientation. There is 
no minimal element under the 
orientation. 




(b) A good orientation. There 
are some minimal elements un- 
der the orientation (in gray). 




(c) A very good orientation. 
There is only 1 minimal element 
under the orientation. 



Figure 2: Some basic examples of orientations. We consider a network of 6 solutions over the domain D = {a, b, c} for 
variables X = {x, y}; by shortcut ab we mean that variable x takes value a and variable y takes value b. {aa, ba, ca} 
is a clique for variable x. 



A natural case to investigate is when these two quantities are equal, namely when each allowed value 
is dispatched a complimentary weight proportional to its intrinsic weight. So we deal here with the 
homogeneous case cIf = sf and show that whatever sf may be, there will exist an ordering which 
is at least as good as the weighting system, as will be stated in theorem 1321 The proof consists in 
choosing variable per variable the order <f,x in a way that does not increase the global weight. For 
our recurrence to work we use the homogeneity property. Just as we defined a generator up for a 
weight wfi so need we now to define a generator /ip for an orientation mp. 



Definition 23. We define the following binary weight function: 



fi F (x,a,A) = 



1 if a is the minimum of A for <f,s 
otherwise . 



(12) 



rriF (<r, x) 



Hf (x,<r(x) ,A F (o-,x)) 



(13) 



At each step of the recurrence, some variables are ordered while the other ones are weighted. 
That leads us to introduce the following definitions. We are going to substitute binary weights 
ttif's to original weights wp's variable per variable, so we call S the set of couples of (variables x, 
orders <f.x) where tof's are used and we define 



Definition 24. 



fiir(cr,S) = JQ m F (cr, x) J| W F ((7,x) 

xGX\B. 



zGE 



and we extend it to a set S of solutions by 

n F (s,E) = ^tt F (a,s) 



(14) 



(15) 



crGS 
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Remark 25. What happens when S is empty? 

n F (S(F),®) = W F (S(F)) . (16) 

Namely, by definition, for any solution a e S(F), tt F (a, 0) = Y[xex WF ( a ' x ) ~ W F (a). But 
ft F (S (F) , 0) = Z<reS(F) Vf (a, 0) and W F (S (F)) = E. e5 (F) W F (a). 

Remark 26. What happens when 5 is full? Suppose that for all variable x, < F . X is a total order on 
D. Let -<p be the orientation induced by {(x, <F,x)} xeX - Then 

n F (S{F),{(x,< FtX )} x&x ) = \M^ F (F)\ ■ (17) 

Namely, let us recall that for any solution a E S(F), Q F (a, {(x, <f,x)} xgX ) = Ilzex to f (c, x). 
Thus Vt F (a, {(x, <F,x)} xe x) = 1 iff Vx £ A", cr is the minimum of N F (a, x) for <p, x (or equivalently 
for -< F ); in other words a is minimal among all of its neighbors, which means that a is minimal (since 
-< F compares neighboring solutions only). Thus ftp ({(x, <F,x)} xeX ) ^ s ^ ne numDer of minimal 
elements of the underlying orientation -< F . 

We are now ready to state the main lemma in this section. 

Lemma 27. Suppose that s F is unitary and dp = s F . Then for each set S, each variable xq ^ 5, 
there exists a total order <p. Xo on D such that £l F (S (F) , 5 U {(xq, < f , Xo )}) < &f (<S (F) , 5). 

At first reading it might be convenient to jump directly to theorem [3H because the proof of 
lemma [57] is somewhat technical and requires some more notations and sub-lemmas. We fix a 
variable xo ^ 3. Let a be an element of D and A be a subset of D. We consider the preimages 
of (a, A) obtained through mapping a solution a of instance F to (a (xq) , A F (a, xq)). We denote 
these preimages as follows: 

S F)X0 (a,A) = {a e S(F), a (x )= a and A F (a,x )= A} . (18) 

Note that: 

1. when a $ A, Y, Fxo (a, A) = 0; 

2. the 'Sp.xa (a, A) are pairwise disjoint and |Jacd Sf.i { a > A) = 5 (F); 

aSA 

3. if <j, r 6 £f,£c (a, A), then iu^t (<j, xo) = w F (r, a^o) = u F (xq, a, A) 
and m^ (c, xo) = m^ (r, xo) = M_f (a?o, a, A); 

4. we call 

Zf.e.xo (a> A) = J^ |]mj?(ff, ar) [~] lUpfoa;) ; (19) 

cr6S FiX0 (a,A)a;eH igl\(Hu{i )) 

then by item[3l 

top (E F>XQ (a, A) ,5) = top (x ,a, A) • Z F>s , Xo (a, A) ; (20) 

Qir(E F]Xo (a,A),SU{(3;o,<F, IO )}) = Mf (xo,a, A) • Z F>s ,x (a, A) . (21) 
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We now need to explore further both quantities we want to compare. It will be convenient to use 
the following quantities: let E C D and a G E; we define the following quantities: 



t F ,s, X0 (a,E) = J2 Z ^, X0 (a,A) ; (22) 

ACE 
A3o 

Zf,b, X0 (E) = ^ ^ WF (xo, a ,A)-Z i ,, s , :Co (a,A) . (23) 

ACE a£A 

So what is the purpose of introducing these extra quantities? They will help us prove lemma 
|2"T1 through the following facts. 

Fact 28. If ax < Fxo a 2 <f,x • • • <F, X0 a d, then 

d 
n F (S(F),'»U{(xo,<F, xo )}) = ^2CF,s^ (a i ,D\{ai t ... t a i -i}) . 

Proof. We use the partition mentioned in item [2 



il F {S(F),Eli{{x ,< Fxo )}) = Q F |J S F , X0 (a, A), SU{(io,<F W )} 

\ ach 

VagA / 

= £ ft F (S F , X0 (a, A) ,H U {(s , <F,x )}) 

ACD 
a£A 

= J^ M-F (x , a, A) • Z FtStXo (a, A) by eq. [21] 

ACD 
a£A 

= ^2^2 VF ( x o, a, A) • Z F>3>Xo (a, A) 

oeDACD 
A3a 

= 2-^i 2-^i a is the minimum of A for <f,x ' F > 3 > x ° t 0,1 ' 

oGDACD 
A3a 

d 

= ^ ^ Zf,s,x (oj, A) since ai <_f, Ko • • • <f,x «d 

8=1 ACD\{oi r .,ai-i) 
Ago; 

d 

= /", Cf.h.xq (at, D\ {oi, • • • , Qj-i}) • 



! = 1 

a 



Fact 29. For a/? xo ^ 5, ^f (5 (F) , S) = &r )Hl x (£>)■ 
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Proof. We use again the partition mentioned in item [5J 



q f (S(f),e) = n F 



VaeA 



\_\ ^f,x (a, A) , 5 
ieA / 

£ r! F (S FiX0 (a,A),S) 

ACD 

^ u F (x , a, A) • Zp iS ,x (a, A) by eq. [2D] 

ACD 
aGA 

€f,H,xoP) • 



a 



Fact 30. IfEC-DjACEjCitzAjSp is unitary and d F = s F then 



ojp (x, a, A) y j dp (x, b) = dp (x,a) + u> F (x, a, A) y, d F (x,b) 

beE beE\A 



Proof. If a G A, then by equation top (x, a, A) J2beA ^f ( x > ^) = Sf ( x ' a ) SbeA ^■F ( x ' ^) + 
dp (x, a) X)fceLi\A SF (*> ^)- By equality aV = sf and the fact that s F is unitary, we get d F (x, a) = 
Wir(#,a,A)f] 66A d F (x,6). n 



Fact 31. Let x$ £ 3 and £" any nonempty subset of D. Suppose that s F is unitary and dp = sp. 
Then there exists b££ such that £f,h,£c (E) > C-F,h,k (a,E) + £f,e,x (^\{ a })- 



Proof. Let us call ag an element of E 1 minimizing CfExo (o,,E) + £f,e,xo (-^A {°}) when a E E: 
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£f,e.,x (E) ^ d F ( x o,b) = ^d F (x Q , b) ^ ^f {xo, a, A) Z F , s ,x (a, A) 

beE ACE 

aeA 

= ^ d F (x , a) Z F>S ,x ( a ; A) 

ACE 
aeA 

+ *^2 u> F (so, a, A) ^ d F (x , b) Z F ,3,x (a, A) by fact [30] 

ACE beE\A 

aeA 

= ^ dp ( x 0' a ) Z f.s.x (a, A) 



beE beE ACE 

aeA 



ACE 
aeA 



2J wf (xo,b, A) d F (x ,a) Z FS . X() (6, A) 



ACE 

aeE\A 

beA 



= ^d F (xq, a) ^2 z f,s,x (a, A) 



aeE ACE 

A3a 



+ ^2d F (x ,a) ^2 ^2u F (xo,b,A)Z FS , Xa (b,A) 

aeE ACE\{a}beA 

^2 d F {xo, a) (Cf,h,x ( a , E ) + €f,b,x (E\ {a})) 



aeE 



- ^2 d F ( x o, a) (Cf,s,x (ao, E) + £, F ~, Xo (E\ {a Q })) 



aeE 

That gives what we want since XbeE dp ( x o, b) ^ (by definition [51 dispatchers must be positive). 

□ 

Proof of Lemma H3 By fact EH f2_F (5(F) ,3) = £ f ,e, Xo (D). From D we successively remove 
what we call oi, Gt2> • • • > fl d till we reach the empty set; applying at each step fact [31] yields that 
£f,s,x (-D) > Eti Cf,h,x (Oi, D\ { 0l , . . . , Oi_i}) + 6r,E,x (0)- By definition, £ F:S , X0 (0) = 0. What 
order < F , X(I shall we choose on Dl 

Of course: a\ < Fxo a 2 < F , Xo ■ ■ ■ <f,x q a d - 

Then by factES fi F {S (F) , 5 U {(s , <f,x )}) = Eti Cf,3,x fa, D\ {oi, . . . , Oi_i}). So in the 
end n F (5 (F) ,S) > Q F (S (F) ,3 U {(x , <f,x )})- 

Theorem 32. For any instance F, any positive and unitary weighting seed s F , when d F — s F , 
there exists an instance dependent orientation -< F induced by a set {{x, <F.x)} x£X °f total orders 
on D, such that \M-< F (F)| < W F (S (F)). 

Proof. By remarkES] W F (S (F)) = Q F (S (F) , 0). Starting with 3 = 0, we add elements (x , < F , X0 ) 
to 3 such that £l F (S (F) , 5) > tt F (S (F) , 5 U {(x , < F , X0 )}), which is possible by lemma ETI At 
the end of the process we have thus £l F (S (F) , 0) > ftp (5 (F) , {(#, <F,ir)} : rex)- Let ^ F be the 
orientation on S (F) induced by {(x, <F.x)} xeX - 
By remarkEi ^f (S (*") > {(*, <*>)W) = \M <F (F)|. So W F (S (F)) > |A^ F (F)|. □ 
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Whether this theorem is true for heterogeneous weights remains an open question. 

Remark. In the particular case of boolean satisfiability (i.e. when D — {0, 1}), there is no choice 
on d F '- the weighting system is necessarily homogeneous. Thus in this case weighting is not better 
than ordering. 

6.3. Instance Independent Case: Ordering and Weighting Are Equivalent 
Definition 33. The weight of a CSP instance F is: 

7 (F) = W F (S (F)) . (24) 

By extension, the weight of a set T of CSP instances is: 

7CF)=X>( F ) • (25) 

A permutation over the domain of values is a bijection 7r : D — > D. A renaming of values is 
a family of permutations II = (tt x ) x£X over the domain D. For a CSP instance F, let II (F) be 
the instance where every occurrence of a value a for every variable x are replaced by n x (a). A 
set of CSP instances T is said to be closed under renaming if for any renaming II, if F 6 T then 
II (F) € T . By abuse of notation, for any valuation v, we denote by II (v) the valuation that assigns 
value tt x (v (x)) to variable x. 

Let us first give a very simple yet useful fact: 

Fact 34. Let II be a renaming, F and G be CSP instances. Then 

1. aeS(F) iff TI (a) 65 (11(F)); 

2. A JI{F) (TI(a),x)=TT x (A F (a,x)). 

Note that almost all sets of CSP instances we know to be dealt with in the literature are closed 
under renaming. 

Let T be some set of instances closed under renaming. We prove in the sequel that 7 (T) = 
Sfg-f \M-< (F)\ f° r an y instance independent orientation -< on solutions as defined in section f6. II 
That can be interpreted as follows: on average on J 7 , the weight of all solutions is equal to the 
number of minimal solutions, independently of the orientation -<. The proof idea is to partition the 
couples (solutions, instances) in a way that the weight of each class of the partition has a weight of 
1 and corresponds to a minimal element for -<. 

We define the set C of couples (a, F) where F is an element of T and a a solution of F: 



7 (T) can be written as 



C = {(a,F)} FeF . (26) 

aeS(F) 



7 (.70= E Wf ^) ■ ( 27 ) 

(a,F)eC 



For some variable x and some valuations V\ and V2, we define the permutation n x<VljV2 on D as 
the transposition which swaps V\ (x) and v 2 (x), and the renaming 11,^ „ 2 as the collection of these 
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permutations, variable per variable: 

[ V\ (x) if a = v 2 (x) 

(28) 





J Vi (x) if a = v 2 (x) 


'x,v 1 ,v 2 ( a ) 


= \ v 2 (x) if a = vi (x) ; 




1 a otherwise 


*~^V-L,V2 ~ 


\^X,V1,V2 ) X £X ' 



(29) 

Note that these definitions are symmetric in v\ and v 2 . Moreover note that Tl Vl ^ V2 (v{) = v 2 and 
n^^ 2 (v 2 ) = vi. 

Consider a formula F G F and a solution r of F. We denote by x^ (t) the set of valuations cr 
assigning each variable x one of the values in the set Ap (t, x): 

Xf(t) = 1[A f (t,x) . (30) 

xGX 

When t is a solution of F, we denote by C (r, F) the set of all renamings of (r, F) ranging in 

Xf(t): 

C(r,F)^{(a,n^ T (F))} (7€xF{T) . (31) 

Lemma 35. If t is a solution of F and a G Xf (t), i/ien a is a solution of G = H a ,r (F) and for 
all variable x G X , Aq (cr, a;) = A^ (r, x). 

Proof. Since G = n CTT (F), cr = 11^.,- (r) and r is a solution of F, by fact [M] we know that cr is a 
solution of G. Moreover by fact[34l for every variable x, Aq (cr, x) = 7r X)CT)T (^4f (t, x)). By definition 
of Xf , a (x) G Ap (t, x). Since n x ,a,T swaps two values r (x) and cr (x) that are both elements of 
A F (t, x), 7r Xj0 - !T (A F (r, x)) = A F (r, x), hence A G (cr, x) = A F (r, x). D 

Lemma 36. The set {C (r,F)} FeF is a partition of C. 

reM-i(F) 

Proof. If (a, G) G C (r, F), then by lemma l35l a is a solution of G. Moreover, by closure of F under 
renaming, G G F. Thus C (r, F) C C. Now it is sufficient to prove that V (a, G) G C there exists a 
unique (t, F) where F £ J, r is a minimal solution of F and (a, G) G C (r, F) . 

• Existence of (r, F): for every x, let r (x) be the minimal value in Aq (a, x) according to the 
order < x underlying -<; by construction, r G xg (cr). Consider the renaming IL^ T and let 
F = Ilcr r (G). By lemma [35l r is a solution of F and for all variable x, Ap (r, x) = Aq (a, x). 
Since for all x G X . r (x) is the minimal value in Ap (r, x), r is minimal for the orientation 
-<. Moreover for all x, a (x) G Ap (r, x), thus a G xf (t); and since G = n ff . T (F), we have 
that (cr,G) gC(t,F). 

• Uniqueness of (j,F): let (t',F') be such that C(t',F') 3 (a,G), i.e. cr G X-F ( r an d 
G = n a T / (F'); then by lemma 1531 for all variable x, Aq (a,x) = Apr (r',x). By minimality 
of t', t' (x) must be the minimum of Aq (a, x) for each variable x. 

□ 
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Lemma 37. Suppose that the weight wf is obtained from a unitary and instance independent 
generator uj. Let (r, F) be an element of C; then X)( CT g)6C(t F) Wg (&) = 1- 

Proof. First note that by lemma l35l for all (a, G) G C (r, F), we have Ac (c, x) — Af (t, x). Thus: 

J2 w G {&) = y, \[wg{o-,x) 

(ct,G)GC(t,F) (a,G)eC(T,F)xeX 

= y I w (x, a (x) , Aq (<t, x)) since a; is instance independent 

(ct,G)£C(t,F)x£X 

= y. If w { x i a ( x ) > Af (t, a;)) since Aq (a, x) — Ap (t, a;) 

= II E w(x,cr(a;),A F (r,a;)) 

rrGX cr(a;)GA_F(r,2;) 

= I | 1 since ui is unitary 

xex 
= 1 . 

□ 

Theorem 38. Let T be a set of CSP instances which is closed under renaming. Let wf be a 
weighting system built from a unitary and instance independent weight generator u). Let -i, be an 
instance independent orientation. Then it holds that X^fs-F l-Mx {F)\ = 7 (J 7 ). 

Proof. It is a mere combination of lemmas [36] and 1371 

FeF 

= E E ^w 

FeFo-GS(F) 

= J] PF F (a) 

0,F)eC 

J^ J^ IVg(o-) by lemma [SB] 

F£J (er,G)eC(r.F) 



= \_. 1 by lemma |3"T1 

rGAI^F) 



fgf 



D 



Closure under renaming involves symmetry, so it is not surprising that on average all weightings 
on the one hand and all orderings on the other hand should be equivalent. What is more surprising 
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though, is the fact that weightings and orderings are equivalent. This is noteworthy because weights 
are simpler to handle in calculations (they yield more compact and tractable formulas, see Boufkhad 
and Hugel [HI). 

7. Conclusion and Perspectives 

Through our Weight Conservation Theorem we gave sufficient conditions to have a correct 
weighting on solutions of CSPs. We were able to apply it to two different weightings: the first one, 
which is very general, was built from seeds and dispatchers; the second one was specifically designed 
to improve on Maneva et al.'s weighting. Thanks to this new weighting scheme, we obtained an 
improvement on the upper bound on the existence of non-trivial cores obtained by Maneva et al. 
to 4.419. 

We also showed an equivalence between weighting and ordering over a set closed under renaming 
when they are instance independent. On the contrary, when weighting and ordering may depend 
on instances, we showed that given an homogeneous weighting it is possible to find an ordering 
which is not worse, but what happens for heterogeneous weightings? is it always possible to find 
for a given weighting a corresponding ordering? 

Other perspectives include: is it possible to define a correct non-uniform weighting? how to gen- 
eralize boolean partial valuations to general CSPs? how to extend our weighting when considering 
neighbors of neighbors, or more generally neighbors at bounded distance? 
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Appendix A. Proof of Theorem 1191 

Appendix A.l. First Moment of Cores 

We strongly advise the reader to read Maneva and Sinclair [1] before reading our calculations, 
since we reuse all of the notations and arguments from there. So we only highlight the similarities 
and the differences. 

The calculation of Maneva and Sinclair [1] works in two steps: 
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1. compute an upper bound function on the first moment / of covers and discard the range of 
variables where / < (because if there are no covers, then there are no cores either); 

2. compute an upper bound function of the first moment / + h of cores and maximize it on the 
remaining domain of the variables. 

In particular they introduce some variables s, t, u\ to these we add v. 

s: the size of the cover or core i.e. the number of variables in a controlled self-constrained set (they 
get a weight of 1); we denote them by symbols Xi where i G SC = {1, ..., s}; 

v. number of invertible variables (they get a weight of — ^); we denote them by symbols Xi where 
iel = {s + l,...,s + t + v}; 

u — v. proportion of starrable but non-invertible variables (they get a weight of 1 — p); we denote 
them by symbols Xi where i G SNI = {s + t + v + 1, ..., s + u}; 

t — u: number of non-starrable variables not in the previous self-constrained set (they get a weight 
of 1); we denote them by symbols Xi where i G NS = {s + u + 1, ..., s + i}; 

n — s — t: number of variables at value * (they get a weight of p); we denote them by symbols Xi 
where i G S = {s + 1, ..., S + 1 — u}. 

p is the probability for a clause of type 3 to be included in the Poisson model. In table lA.il we sum 
up all possible types of clauses in order to count them. We assume by symmetry that we have an 
assignment with values in {0, *} only (no l's). Note that clauses of type 1 and 2 are the same as in 
Maneva and Sinclair [l| and are used in the expression of / rather than in the expression of h. 
As Maneva and Sinclair [lj did, we first define the following quantity: 

Q = (1 -p)(l)+ S ( t 2)+ 2 ( n - S - t ')(( t 2)+ St )+ U (( t 2)+ St )+ 2v( . n - S - t ')( S + t ) 

■ (l - (1 - p )G)+") t_u (l - (1 _ p fns-tKs +t) y-« 
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Table A.l: Clauses and their sizes. 



Types 


Clause 


Sets of subscripts 


Size 


Status 


1 


(1/? V ti-> 7 V Ju L* 


i,j,k€SC 


(S) 


forbidden 




i,j(=SC, keV\SC 


2 (n -*)(*) 


forbidden 


2 


Xi V Xj V ~x~k 


i,j, k G SC 


(5) 

for each Xfe 


At least one 
for each k G SC 


3 


xi V Xj V Xk 


ieSCUNSUlU SNI, 
j,k€NSUlU SNI 


($) + «(£) 


forbidden 


*h V V ■X' 7 V ^u \c 
*X/ ■} V <a^ 7 V ^u L* 


ieSCUNSUlU SNI, 

j e NS U I U SNI 

keS 


2(n-s-t)((2) +si) 


forbidden 


Xi V Xj V Xfe 


ieSCUNSUlU SNI, 

j e NS U I U 5WJ 

fc e I U £W J 


u ((5) + St) 


forbidden 


XiVxJVxk 
Xi\/x~j\/~x~k 


i G 5C U TVS' U I U SAT 
fees' 


2 (n - s - t) V (s + 1) 


forbidden 


Xi V Xj V Xk 


ie SCUNSUIU SNI 
j G NS U I U SA/ 

fee as 


(l)+st 
for each Xk 


At least one 
for each k G NS 


*h ■! V *// 7 V Ju L* 
*h ■} V *Ju 7 V <a^ L* 


i G SC U TVS U / U SNI 

j G SNI 

keS 


2(n-s-t)(s + t) 
for each Xj 


At least one 
for each j G SNI 
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Now with Q we can write down the first moment of cores having n — s — t variables at value *: 



n — S \ _j v—s , >,, / t \ x— v _ .. / M 



= f—t ( n - S \ 2 * (1 _ p P+s(X)+^s-m) + ^) 

■ J2 a - p) u (*) (i - P ) u ^) +at ) (i - (i - p)(^ +st ) t_ " 

u=0 ^ ' 

• y^ 2~ v ( u ^\ (i - p) 2, '("- s - i )( s+t ) (i _ (i _ p) 2 («- s -*)(^ 

= pn-.-t (^ - A 2 * (1 _ p) (t)+s(l) + 2(n-s- t )((l) +S t) 

* /jA t -,. / /i \2(n— s-t)(s+t) \ u 

E (i - p)" ( ) (i - ?) u(G)+st) (i - a - p)®-*) ( i - ( p) 2 ) 



u=0 



■ (a - » a - ,)<»« (i - (1 -" )2 ';'""' ,+ " ) + 1 - a - *><•>") ' 
Yi- a - P )a)+ st f p+ ii^l ( i _ p) 2(n-.-t )(s+ t)\ y 

s = LanJ, * = 6n and p = ^ff^l,,)) +0 (£), so: 

lnEZ t , /(l-a) 1_a 2y-°- 6 \ a (1 - d) 6 (b (6 - 56 - 3a) + 12 (1 - a - 6) a) 

/j = hm = In ' 



n^T+oo n '" I 6 & (1 -a-6) 1_a_b / 2 (4 -a 2 (3 -a)) 

-&m(l-e _J<l (p- 



,-A I „ 1_ P„-B 



where ^ = 3 ° ( M^ b+ ^ ) and B = 6°(i-<0(i-°-6)(°+6) , 

2(4— 0^(3 — a)) (4 — a J (3 — a)) 

Appendix A. 2. Maximization 

Just like Maneva and Sinclair [l|, we want to show that for all a £ [4.419, 4.453], a £ [ 4 4 g 3 3 , l] 
and r > 1, when p (a) = 0.3758a + 0.7067, either / (a, a, r) < or for all 6 £ [0, 1 - a], f (a, a, r) + 
h (a, a, r, p (a) ,6) < 0. Note that our function / is the same as in Maneva and Sinclair [l|; only h 
differs. 

As in Maneva and Sinclair [l|, if r < 1.2 then -^ > 0, and if r > 670 then -J^ < 0, so we are left 
with the region a £ [ 44 g 3e a , 0.999], r G [1.2, 670]/The points where / (4.419, a, r) > -0.0001 are 
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Figure A. 3: / (4.419, a, r) > -O.OOOl inside the contour line, so only when a g [0.28,0.75] and r g [1.4,14], 



depicted in figure \A~3[ This corresponds to the domain where we must check that / + h is negative. 
We get bounds slightly different from Maneva and Sinclair [l|: a <E [0.28,0.75], r € [1.4,14], 
a G [4.419, 4.453] and b G [0, 1 - a] . 

Figure E3] gives the shapes of our / and f + h at a = 4.419 and at a = 4.453. 

The domain of the variables is a finite product of segments, all functions involved are smooth, 
except at the boundary points where b = 1 — a (but this is due to the asymptotic equivalent we used 
for the binomial coefficient). So in order to maximize / + h, following again Maneva and Sinclair 
[l|, we performed a sweep over this domain with a step of 0.001 on all variables (and a step of 10~ 5 
in the vicinity of the maximum). In the end we checked our result using the FindMaximum function 
of Mathematica. The maximum oi f + h is —0.0000277225, and the values of the variables at this 
point are a = 4.419, a = 0.678206, 6 = 0.0299196 and r = 1.79833. 
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Figure A. 4: / and / + h at a = 4.453 and at a = 4.419 (with our weights). 



